智能论文笔记

Memory-Augmented Reinforcement Learning for Image-Goal Navigation

Lina Mezghani , Sainbayar Sukhbaatar , Thibaut Lavril , Oleksandr Maksymets , Dhruv Batra , Piotr Bojanowski , Karteek Alahari

分类：计算机视觉

2021-01-13

在这项工作中，我们提出了一种用于图像目标导航的内存调格方法。早期的尝试，包括基于RL的基于RL的方法和基于SLAM的方法的概括性能差，或者在姿势/深度传感器上稳定稳定。我们的方法基于一个基于注意力的端到端模型，该模型利用情节记忆来学习导航。首先，我们以自我监督的方式训练一个国家安置的网络，然后将其嵌入以前访问的状态中的代理商的记忆中。我们的导航政策通过注意机制利用了此信息。我们通过广泛的评估来验证我们的方法，并表明我们的模型在具有挑战性的吉布森数据集上建立了新的最新技术。此外，与相关工作形成鲜明对比的是，我们仅凭RGB输入就实现了这种令人印象深刻的性能，而无需访问其他信息，例如位置或深度。

translated by 谷歌翻译

Risk-Sensitive Policy with Distributional Reinforcement Learning

Thibaut Théate , Damien Ernst

分类：机器学习 | 人工智能

2022-12-30

Classical reinforcement learning (RL) techniques are generally concerned with the design of decision-making policies driven by the maximisation of the expected outcome. Nevertheless, this approach does not take into consideration the potential risk associated with the actions taken, which may be critical in certain applications. To address that issue, the present research work introduces a novel methodology based on distributional RL to derive sequential decision-making policies that are sensitive to the risk, the latter being modelled by the tail of the return probability distribution. The core idea is to replace the $Q$ function generally standing at the core of learning schemes in RL by another function taking into account both the expected return and the risk. Named the risk-based utility function $U$, it can be extracted from the random return distribution $Z$ naturally learnt by any distributional RL algorithm. This enables to span the complete potential trade-off between risk minimisation and expected return maximisation, in contrast to fully risk-averse methodologies. Fundamentally, this research yields a truly practical and accessible solution for learning risk-sensitive policies with minimal modification to the distributional RL algorithm, and with an emphasis on the interpretability of the resulting decision-making process.

translated by 谷歌翻译

Greenhouse gases emissions: estimating corporate non-reported emissions using interpretable machine learning

Jeremi Assael , Thibaut Heurtebize , Laurent Carlier , François Soupé

分类：机器学习 | 人工智能

2022-12-21

As of 2022, greenhouse gases (GHG) emissions reporting and auditing are not yet compulsory for all companies and methodologies of measurement and estimation are not unified. We propose a machine learning-based model to estimate scope 1 and scope 2 GHG emissions of companies not reporting them yet. Our model, specifically designed to be transparent and completely adapted to this use case, is able to estimate emissions for a large universe of companies. It shows good out-of-sample global performances as well as good out-of-sample granular performances when evaluating it by sectors, by countries or by revenues buckets. We also compare our results to those of other providers and find our estimates to be more accurate. Thanks to the proposed explainability tools using Shapley values, our model is fully interpretable, the user being able to understand which factors split explain the GHG emissions for each particular company.

translated by 谷歌翻译

Training a Vision Transformer from scratch in less than 24 hours with 1 GPU

Saghar Irandoust , Thibaut Durand , Yunduz Rakhmangulova , Wenjie Zi , Hossein Hajimirsadeghi

分类：计算机视觉

2022-11-09

Transformers have become central to recent advances in computer vision. However, training a vision Transformer (ViT) model from scratch can be resource intensive and time consuming. In this paper, we aim to explore approaches to reduce the training costs of ViT models. We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24 hours) resources. First, we propose an efficient approach to add locality to the ViT architecture. Second, we develop a new image size curriculum learning strategy, which allows to reduce the number of patches extracted from each image at the beginning of the training. Finally, we propose a new variant of the popular ImageNet1k benchmark by adding hardware and time constraints. We evaluate our contributions on this benchmark, and show they can significantly improve performances given the proposed training budget. We will share the code in https://github.com/BorealisAI/efficient-vit-training.

translated by 谷歌翻译

Bilevel Optimization for Feature Selection in the Data-Driven Newsvendor Problem

Breno Serrano , Stefan Minner , Maximilian Schiffer , Thibaut Vidal

分类：机器学习

2022-09-12

我们研究了基于功能的新闻企业问题，其中决策者可以访问包括需求观察和外源特征组成的历史数据。在这种情况下，我们研究了功能选择，旨在得出具有改进样本外部性能的稀疏，可解释的模型。到目前为止，最新的方法利用正则化，这会惩罚所选特征的数量或解决方案向量的规范。作为替代方案，我们介绍了一种新型的双层编程公式。高级问题选择了一部分功能，这些功能将基于固定验证集的订购决策的样本外成本估算最小化。下层问题仅使用上层选择的功能，了解训练集中决策功能的最佳系数。我们为Bilevel程序提供了混合整数线性程序重新制定，可以通过标准优化求解器求解为最佳性。我们的计算实验表明，该方法准确地恢复了几百个观察结果的实例中的基础真相。相反，基于正则化的技术通常在功能恢复时失败，或者需要数千个观察值才能获得相似的准确性。关于样本外的概括，我们实现了改进或可比的成本绩效。

translated by 谷歌翻译

A Bayesian Variational principle for dynamic Self Organizing Maps

Anthony Fillion , Thibaut Kulak , François Blayo

分类：机器学习 | 神经与进化计算 | (统计)机器学习

2022-08-24

我们提出了组织条件，该条件在各种贝叶斯框架中产生了一种以适应性邻域半径训练SOM的方法。该方法在非平稳设置上进行了验证，并在高维设置中与其他适应性方法进行了比较。

translated by 谷歌翻译

Optimal precision for GANs

Thibaut Issenhuth , Ugo Tanielian , Jérémie Mary , David Picard

分类：机器学习 | 人工智能 | (统计)机器学习

2022-07-21

在学习断开分布时，已知生成对抗网络（GAN）面临模型错误指定。实际上，从单峰潜伏分布到断开连接的连续映射是不可能的，因此甘斯一定会在目标分布支持之外生成样品。这提出了一个基本问题：最小化这些领域的衡量标准的潜在空间分区是什么？基于几何测量理论的最新结果，我们证明，最佳甘恩必须将其潜在空间构造为“简单群集” - 一个voronoi分区，其中细胞是凸锥 - 当潜在空间的尺寸大于大于的数量时模式。在此配置中，每个Voronoi单元格映射到数据的不同模式。我们在gan学习断开的歧管的最佳精度上得出了上限和下限。有趣的是，这两个界限具有相同的减小顺序：$ \ sqrt {\ log m} $，$ m $是模式的数量。最后，我们执行了几项实验，以表现出潜在空间的几何形状，并在实验上表明gan具有与理论相似的几何形状。

translated by 谷歌翻译

Support Vector Machines with the Hard-Margin Loss: Optimal Training via Combinatorial Benders' Cuts

Ítalo Santana , Breno Serrano , Maximilian Schiffer , Thibaut Vidal

分类：机器学习

2022-07-15

由于其损耗函数的无限性，经典的铰链损耗支撑矢量机（SVM）模型对异常观测值敏感。为了解决这个问题，最近的研究集中在非凸损失函数上，例如硬质量损失，该损失将恒定的罚款与任何错误分类或细边样品内的样本相关联。应用此损失函数可为关键应用带来急需的鲁棒性，但它也导致NP硬化模型，这使训练变得困难，因为当前的精确优化算法显示有限的可伸缩性，而启发式方法无法始终找到高质量的解决方案。在这种背景下，我们提出了新的整数编程策略，这些策略可显着提高我们将硬利润SVM模型培训为全球最优性的能力。我们引入了一种迭代采样和分解方法，其中使用较小的子问题来分离组合弯曲器的切割。这些切割量在分支和切割算法中的使用，可以更快地收敛到全球最佳。通过对经典基准数据集的大量数值分析，我们的解决方案算法首次求解了117个新数据集，以达到最佳性，并在基准最困难的数据集的平均最佳差距中降低了50％。

translated by 谷歌翻译

An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations

Florent Bonnet , Jocelyn Ahmed Mazari , Thibaut Munzer , Pierre Yser , Patrick Gallinari

分类：机器学习 | 计算机视觉

2022-06-29

\ emph {几何深度学习}（GDL）的最新进展显示了其提供强大数据驱动模型的潜力。这提供了探索从图形数据中\ emph {部分微分方程}（PDES）控制的物理系统的新方法的动力。然而，尽管做出了努力和最近的成就，但几个研究方向仍未开发，进步仍然远非满足现实现象的身体要求。主要障碍之一是缺乏基准数据集和常见的物理评估协议。在本文中，我们提出了一个2-D Graph-Mesh数据集，以研究High Reynolds制度的机翼上的气流（从$ 10^6 $及以后）。我们还对翼型上的应力力引入指标，以评估重要的物理量的GDL模型。此外，我们提供广泛的GDL基准。

translated by 谷歌翻译

When adversarial attacks become interpretable counterfactual explanations

Mathieu Serrurier , Franck Mamalet , Thomas Fel , Louis Béthune , Thibaut Boissin

分类：人工智能 | 计算机视觉 | 机器学习 | (统计)机器学习

2022-06-14

我们认为，当学习一个具有最佳运输问题双重损失的1- lipschitz神经网络时，模型的梯度既是运输计划的方向，又是与最接近的对抗性攻击的方向。沿着梯度前往决策边界不再是对抗性攻击，而是反事实的解释，明确地从一个班级运输到另一个班级。通过对XAI指标进行的广泛实验，我们发现应用于此类网络的简单显着性图方法成为可靠的解释，并且在不受约束的模型上胜过最新的解释方法。所提出的网络已经众所周知，可以证明它们也可以通过快速而简单的方法来证明它们也可以解释。

translated by 谷歌翻译